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Biomolecule Visualization with Ellipsoidal Coarse-graining (BioVEC) is a tool for visualizing molec- 
ular dynamics simulation data while allowing coarse-grained residues to be rendered as ellipsoids. 
BioVEC reads in configuration files, which may be output from molecular dynamics simulations that 
include orientation output in either quaternion or ANISOU format, and can render frames of the 
trajectory in several common image formats for subsequent concatenation into a movie file. The 
BioVEC program is written in C++, uses the OpenGL API for rendering, and is open source. It is 
lightweight, allows for user-defined settings for and texture, and runs on either Windows or Linux 
platforms. 



1 Introduction 

The use of coarse-graining methods in biomolecular 
simulation is progressively increasing [1], mainly to cir- 
cumvent the current time-scale problem that exists for 
all- atom molecular dynamics simulations, and to allow 
for the computational modeling of ever larger systems 
which address biologically relevant questions. How- 
ever as the coarse-grained constituents encompass more 
atoms, the resulting residual objects become poorly 
approximated by spheres with corresponding isotropic 
potentials, while the steric character of the residues 
becomes more important. Such a scenario would exist 
for the coarse-graining of bases in DNA, or the coarse- 
graining of helices in a protein. 

For these reasons, interest in ellipsoidal coarse- 
graining, perhaps the simplest non-spherical general- 
ization, has recently increased [2-9]. Interest in indus- 
trial applications such as the manufacture of nanoparti- 
cle films has also motivated studies of ellipsoidal coarse- 
graining [10, 11]. 

The molecular dynamics (MD) of rigid objects raises 
some new practical challenges, including a 
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singularity-free representation of rigid body rota- 
tion [12-15] and corresponding energy conserving 
scheme for symplectic integration [16-18]. This is best 
done using quaternions [19,20], which MD simulation 
programs such as LAMMPS [7] currently allow one to 
employ. 



Quite apart from the issues of coarse-graining and 
molecular dynamics, visualization tools for rendering 
thermal ellipsoids representing Debye- Waller factors 
have enjoyed a long and successful tradition [21]. How- 
ever, in spite of widely used standards in crystallogra- 
phy, as well as the above-mentioned progress in coarse- 
graining and molecular dynamics of rigid bodies, to 
the authors' knowledge no currently available program 
allows for the straightforward visualization of the dy- 
namical evolution of coarse-grained systems with el- 
lipsoidal constituents. To facilitate our own analysis 
of such coarse-grained models, we have decided to de- 
velop our own program. The program is portable to 
either Windows or Linux systems, and could in prin- 
ciple be incorporated into many of the more versa- 
tile and widely-used visualization programs such as 
VMD [22], Molmol [23], Molscript [24], PyMOL [25], 
RasterSD [26], Jmol [27], and Rasmol [28]. 
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2 Methods 

The program BioVEC presented in this paper, is a 
program to visuahze the results of MD simulations of 
biomolecules that have been coarse-grained with ellip- 
soids. The functionality of BioVEC is simple and basic, 
as its purpose is not to compete with current molecu- 
lar visualization packages but specifically to allow the 
scientist to visualize dynamical processes involving el- 
lipsoidal coarse-graining. BioVEC automatically cre- 
ates graphical images, which can be compiled to movie 
files in an external editor, such as Slide Show Movie 
Maker [29] (Windows) or MJPEG Tools [30] (*NIX). 
It is also possible for the user to manually step through 
the simulation, and to save the current view of the 
molecule to an image file in several formats, including 
png, bmp, and jpg. The representation used in BioVEC 
differs from techniques such as cartoon and ribbon rep- 
resentation, as these techniques aim to smooth out de- 
tails, in all-atom simulations for example. The aim of 
BioVEC is rather to show all the features of already 
coarse-grained simulation data. 

BioVEC reads in molecular dynamics simulation 
data from a file containing the positions of the coarse- 
grained species and the orientations of the ellipsoids 
for all time steps in the molecular dynamics simula- 
tion. The input format for BioVEC is based on the 
output dump file format in LAMMPS [7], which is of 
the form: 

species-no species-type position X 3 
orientation X 4 

for each time step. The program also reads in a file 
with information on the topology of the molecule, in 
order for the program to display bonds between the 
species. In the current version of the program, cova- 
lent bonds are not allowed to break or form between 
different time steps. The information in the topology 
file comes from the LAMMPS input file, and is of the 
form 

bond-no bond-type Ist-species 2nd-species 
Ist-origin 2nd-orgin 

where the bond goes from the Ist-origin on the 
Ist-species to the 2nd-origin on the 2nd-species, 
corresponding to the species-no in the simulation 
data file. Each ellipsoid can have an arbitrary num- 
ber of bond origins, as specified in the input file. The 
Ist-origin and 2nd-origin entries in the topology 
file are optional if the species only have one bond ori- 
gin each. 

The user supplies the names of the simulation data 
and topology files in an input file with information 
about the coarse-graining of the system. This file 
should contain the number of coarse-grained species, 
the radii of the different ellipsoids and spheres, as well 



as the vector from the centre of mass of the ellipsoid 
to the location of the bond connecting the ellipsoidal 
residue to its covalently-bonded partner. Spheres are 
treated as ellipsoids with all three radii equal. Infor- 
mation about the colouring and texture of the species 
is also entered into the input file. 

The BioVEC program can also read and visual- 
ize ANISOU anisotropic temperature factor records in 
PDB standard form [31]. 

2.1 Quaternions 

Quaternions [19] are frequently used in computer 
graphics to represent rotations and orientation in 3- 
dimensional space. One of the main advantages of 
quaternions over other representations, such as Euler 
angles, is the absence of singularities in the quaternion 
representation. 

A quaternion is a four-dimensional extension to the 
complex numbers, 

q = gou + qii + q23 + qs^- (1) 

where go, ^i, ^2, and qs are real numbers, and u, i, j, 
and k are such that = j ^ = = ij k = — = — 1, 
and ij = — j i = k. Quaternions are often written in a 
more compact vector form, 

q = [^0,^1,^2,^3], (2) 

where the quantities go, ^i, ^2, and qs again are real 
numbers. 

The relation between the quaternion representation 
and the rotation and angle a about a unit axis, n = 
XX + + is given by (see for example [32]) 

s = {ql^ql^ql)^^\ 
X = qi/s, 

y = q2/s, (3) 

z qs/s, 
a = 2 cos~^{qo) , 

for s 7^ 0. Note that for the qi to represent a rotation 
Yl^=o Qi = The orientation of the different ellipsoids 
are read by BioVEC from the simulation dump-file in 
vector form (Eq. [2|), together with the position and 
ellipsoid type. 

2.2 Bond vectors 

The bonds between ellipsoidal species are drawn be- 
tween the same points on the ellipsoids as were used 
in the MD simulation. The vectors from the centre of 
mass of the different types of ellipsoids to the location 
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Figure 1: Representation of the bond vectors. The bond B 
is formed between the points bi and 62, given by the bond 
vectors Pi and P2. A second bond vector for the right 
ehipsoid is indicated by a red arrow. The bond vectors 
are given relative to the principal axis (dotted lines) of the 
ellipsoids. 

where the covalent bond joins the ellipsoid (Fig. [1]), 
are supplied by the user in the configuration file. The 
vectors are given in the principle (body-centered) basis 
of the ellipsoid for each bond origin on each ellipsoid 
species. 

3 Implementation 

The BioVEC program is written in C++, using the 
OpenGL API for rendering. The GLUT library is used 
for creating menus. The user input is command line 
driven, and the interaction with the program is via 
keyboard commands and mouse menus. 

Graphical images from the program can be saved 
to file in several different formats through the DevIL 
library [33]. Images can be saved automatically for 
each time step of the simulation as the program reads 
through the data file, or the current view of the 
molecule can be saved by the user. 

The BioVEC source can be compiled and run on 
both the Windows and Linux platforms. 

4 Results and Discussion 

Figure [2] shows an example of BioVEC running un- 
der Windows XP, depicting the console window, the 
OpenGL window, and the GLUT mouse menu. The 
molecule can be rotated and zoomed with either the 
keyboard or the mouse, and the time stepping can be 
done automatically or one step at at time by the user. 
The user can let the program automatically print the 
current view to an image file, or can choose to manu- 
ally print the current view with a press of a key. This 




Figure 2: View of the BioVEC interface, running on Win- 
dows XP. 



way stereo views of the molecule can be created. 

The advantage of using a program, such as BioVEC, 
capable of visualizing ellipsoids, is seen in Fig.[3l where 
the all-atom model of single-stranded DNA is shown 
alongside with a coarse-grained ball-and stick model, 
and an ellipsoidal model, both visualized in BioVEC. 
In the all-atom model the orientation of the bases can 
be seen, while this information is lost when display- 
ing the coarse-grained DNA strand with the ball-and 
stick model. The ellipsoidal visualization model used 
in BioVEC, however, retains the orient ational informa- 
tion of the all-atom representation. 

Figure [H shows a collapsed globule of single- 
stranded DNA with block-copolymeric sequence 
A(xl5)- T(xl5). Information that would not have 
been seen with a coarse-grained ball-and-stick render- 
ing, such as partial hair-pin formation along with ori- 
entation of the stacked base pairs, is clearly visualized 
in BioVEC. 

The input format of the program also allows for mul- 
tiple bond origins, as shown in Fig. [5l where a 4-helix 
bundle is coarse-grained so that the helices are mod- 
eled by effective prolate ellipsoids. Investigating the 
folding mechanism of such a coarse-grained system al- 
lows one to investigate the assumptions of the diffusion- 
collision model [34], for example. Renderings with mul- 
tiple bond origins may be useful for characterizing the 
constraints due to intermolecular chemical bonds be- 
tween biomolecules. 

Separate strands of coarse-grained polymers can be 
visualized in BioVEC. An example of this is shown 
in Fig. [H where a polymer glass melt, consisting of 
100 polymer strands with ten atoms in each strand 
[35], is visualized. By giving all the atoms on a single 
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Figure 3: Comparison of three different visualization mod- 
els for single- stranded DNA. All- atom CPK representation 
rendered in VMD; coarse-grained ball-and-stick model; and 
coarse-grained ellipsoidal model, both rendered in BioVEC. 




Figure 4: BioVEC visualization of single stranded DNA 
consisting of adenine and thymine, forming a hairpin. 




Figure 5: BioVEC visualization of the homodimeric ROP 
protein 1Y07 four helix bundle, with the helices repre- 
sented by ellipsoids. Multiple bond origins are present for 
this system, and represent the points where the backbone 
chain enters and exits the helices. 




Figure 6: BioVEC visualization of polymer glass [35]. The 
simulation contains 100 polymers with ten atoms in each 
polymer. Each polymer is visualized with different orien- 
tation of the texture to simplify the interpretation of the 
image. 

polymer the same orientation, the separate polymers 
in the glass can be identified. The information about 
the bonds, and thereby the separate polymers, is given 
in the topology file. 

BioVEC can also read and visualize PDB files with 
ANISOU data. Figure [71 shows the thermal ellipsoid 
representation of GMl-pentasaccharide from a com- 
plex with the cholera toxin B-pentamer refined with 
anisotropic displacement parameters at 1.25 A resolu- 
tion (PDB data taken from a RasterSD [26] example 
file). 

BioVEC can be used to visualize ellipsoids for a wide 
range of applications. Figure [8] shows the orientation 
of the dielectric tensors at a cross-section through the 
centre of ubiquitin [36]. The ellipsoids represents the 
local dielectric constant, with the orientation given by 
treating the dielectric tensor as a rotation matrix. The 
sizes of the principal axis are given by the reciprocals 
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Figure 7: BioVEC visualization of the thermal anisotropy 
of GMl-pentasaccharide from a complex with the cholera 
toxin B-pentamer [26]. 




Figure 8: BioVEC visualization of the dielectric tensors at 
a cross-section of ubiquitin. [36]. The principal axis of the 
ellipsoids are given by the inverse of the eigenvalues of the 
dielectric tensor. 

of the eigenvalues, to emphasize the low dielectric con- 
stant in the centre of the protein. 

5 Conclusion 

The BioVEC program presented here is a simple but 
useful tool for visualizing bio-molecules that are coarse- 
grained with ellipsoids. There are many program pack- 
ages available for molecular visualization, but none 
of these can, to our knowledge, display biomolecules 
coarse-grained with ellipsoids. Some programs, such 
as RasterSD [26] and PyMOL [25], allow for the vi- 
sualization of anisotropic temperature factors in the 
form of ellipsoids. However, they are currently not 
suitable for straight-forward visualization of the dy- 
namics of coarse-grained ellipsoidal systems with the 
orientation represented by quaternions. The BioVEC 
program therefore focuses on this feature, making it 
a small and user-friendly program for visualization of 
coarse-grained molecular dynamics data. 

Future revisions of BioVEC should include a graph- 



ical user interface (GUI), in order to improve both the 
functionality and the ease-of-use. Improvement and 
optimization of the rendering algorithm should accel- 
erate rendering of large molecules, or long simulation 
runs. 

BioVEC is open source, and is distributed un- 
der the GNU GPL licence. The source code and 
Windows executable are available for download at 
|h ttp: / / www.phas.ubc.ca/^steve/BioVEC /[ 
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